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Abstract 

Linear receivers are often used to reduce the implementation complexity of multiple antenna systems. In a 
traditional linear receiver architecture, the receive antennas are used to separate out the codewords sent by each 
transmit antenna, which can then be decoded individually. Although easy to implement, this approach can be highly 
. . . sub-optimal when the channel matrix is near singular. This paper develops a new linear receiver architecture that 

(N ■ uses the receive antennas to create an effective channel matrix with integer-valued entries. Rather than attempting 

1^ , to recover transmitted codewords directly, the decoder recovers integer combinations of the codewords according 

' to the entries of the effective channel matrix. The codewords are all generated using the same linear code which 

. , guarantees that these integer combinations are themselves codewords. If the effective channel is full rank, these 

' integer combinations can then be digitally solved for the original codewords. This paper focuses on the special case 

I— ^ , where there is no coding across transmit antennas. In this setting, the integer-forcing linear receiver significantly 

' outperforms traditional linear architectures such as the decorrelator and MMSE receiver In the high SNR regime, 

the proposed receiver attains the optimal diversity-multiplexing tradeoff for the standard MIMO channel. It is further 
r ^ ' ' shown that in an extended MIMO model with interference, the integer-forcing linear receiver achieves the optimal 

^ 1 , generalized degrees-of-freedom. 

O . Index Terms 

MIMO, linear receiver architectures, linear codes, lattice codes 

(N ■ 

> : 

^ . I. Introduction 

\D . 

■ It is by now well-known that increasing the number of antennas in a wireless system can significantly increase 
^ \ capacity. Since the seminal papers of Foschini and Gans 1 1 1 and Telatar fT\, multiple-input multiple-output (MIMO) 
channels have been thoroughly investigated in theory (see |3| for a survey) and implemented in practice [4J- This 
^ capacity gain usually comes at the expense of more complex encoders and decoders and a great deal of work 
I has gone into designing low-complexity MIMO architectures. In this paper, we describe a new low-complexity 
architecture that can attain significantly higher rates than existing solutions of similar complexity. 

We focus on the case where each of the M transmit antennas encodes an independent data stream (see Figure 
[T|). That is, there is no coding across the transmit antennas: each data stream w^, is encoded separately to form a 
codeword of length n. Channel state information is only available to the receiver. From the receiver's perspective, 
the original data streams are coupled in time through encoding and in space (i.e., across antennas) through the MIMO 
channel. The joint maximum likelihood (ML) receiver simultaneously performs joint decoding across time and 
receive antennas. Clearly, this is optimal in terms of both rate and probability of error. However, the computational 
complexity of jointly processing the data streams is high, and it is difficult to implement this type of receiver 
in wireless systems when the number of streams is large. Instead, linear receivers such as the decorrelator and 
minimum-mean-squared error (MMSE) receiver are often used as low-complexity alternatives 0. 
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Traditional linear receivers first separate the coupling in space by performing a linear projection at the front-end 
of the receiver. In order to illustrate this concept and motivate the proposed new approach, we consider the 2x2 
MIMO channel characterized by the following matrix: 



H 



2 1 
1 1 



(1) 



The simplest choice of a linear receiver front-end inverts the channel matrix. This receiver is usually referred to as 
the decorrelator in the literature. That is, the receiver first applies the matrix 



H 



1 -1 
-1 2 



(2) 



to the received signal. Overall, this converts the original channel into a new equivalent channel characterized 
by the identity matrix and colored Gaussian noise, i.e., to two scalar channels with correlated noise. The linear 
receiver then proceeds by separately decoding the output of each of these two channels. The well-known drawback 
of this approach is that the noise vector is also multiplied by the linear receiver front-end matrix given in Q, 
which alters the variances of its components. In our simple example, if we assume that the original channel had 
independent additive white Gaussian noise of unit variance, the equivalent noises after the linear receiver front-end 
have variances of 2 and 5, respectively. That is, while the receiver front-end has nulled out cross-interference, it 
has also significantly increased the noise levels. 

The integer-forcing linear architecture advocated here is based on the recent insight that if on all transmit antennas, 
the same linear or lattice code is used, then it is possible to not only decode codewords themselves, but also integer 
linear combinations of codewords directly [6]. Let us denote the codeword transmitted on the first antenna by xi 
and the codeword transmitted on the second antenna by X2. Then, for the simple example matrix from ([T]), the 
receiver can decode the integer linear combination 2xi + X2 from the first receive antenna and the combination 
xi + X2 from the second receive antenna. From these (following [6l), it is possible to recover linear equations of 
the data streams over an appropriate finite field, 2wi + W2 and wi + W2. These equations can in turn be digitally 
solved for the original data streams. The key point in this example is that the noise variances remain unchanged, 
which increases the effective SNR per data stream. Note that for more general channel matrices beyond the simple 
example here, it will also be advantageous to first apply an appropriate linear receiver front-end, albeit following 
principles very different form merely inverting the channel matrix, as we explain in more detail in the sequel. 

In this paper, we first consider the standard MIMO channel and develop a new integer-forcing linear receiver 
architecture that provides multiplexing and diversity gains over traditional linear architectures. Our approach relies 
on the compute-and-forward framework, which allows linear equations of transmitted messages to be efficiently 
and reliably decoded over a fading channel [6|. We develop a multiple antenna version of compute-and-forward 
which employs the antennas at the receiver to rotate the channel matrix towards an effective channel matrix with 
integer entries. Separate decoders can then recover integer combinations of the transmitted messages, which are 
finally digitally solved for the original messages. We show that this is much more efficient than using the receive 
antennas to separate the transmitted codewords and directly decoding each individual codeword. Our analysis uses 
nested lattice codes originally developed to approach the capacity of point-to-point AWGN and dirty-paper channels 
Q-liOl and for which practical implementations were presented in llF] and subsequent works. 

Next, we generalize the MIMO channel model to include interference [12,1 , 1 13] and show that the integer- 
forcing receiver architecture is an attractive approach to the problem of oblivious interference mitigation. By 
selecting equation coefficients in a direction that depends on both the interference space and the channel matrix, 
the proposed architecture reduces the impact of interference and attains a non-trivial gain over traditional linear 
receivers. Furthermore, we show that the integer-forcing receiver achieves the same generalized degrees of freedom 
as the joint decoder. Our proof uses techniques from Diophantine approximations, which have also recently been 
used for interference alignment over fixed channels and the characterization of the degrees of freedom for compute- 
and-forward [14 1, [15[. 

In the remainder of the paper, we start with a formal problem statement in Section JIJ and then overview the 
basic existing MIMO receiver architectures and their achievable rates in Section [nil In Section |IVj we present the 
integer-forcing receiver architecture and a basic performance analysis. We show that the rate difference between 
the proposed receiver and traditional linear receivers can be arbitrarily large in Section |Vl We study the outage 
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performance of the integer-forcing linear receiver under a slow fading channel model in Section |Vll We show 
that in the case where each antenna encodes an independent data stream, our architecture achieves the same 
diversity-multiplexing tradeoff as that of the optimal joint decoder. In Section IVlIl we consider the MIMO channel 
with interference and show that the integer-forcing receiver can be used to effectively mitigate interference. We 
characterize the generalized degrees-of-freedom for the integer-forcing receiver and find that it is the same as for 
the joint decoder. 

Throughout the paper, we will use boldface lowercase to refer to vectors, a G M^^, and boldface uppercase to 
refer to matrices, A G W^'^^^''^ . Let denote the transpose of a matrix A and |A| denote the determinant. Also, 
let A^^ denote the inverse of A and A^^ = (A-^A)^^A^ denote the pseudoinverse. The notation ||a|| = \/^~a^ 
will refer to the ^2-norm of the vector a while ||a||oo — maxj |aj| will refer to the £00 -norm. Finally, we will use 
Amax(A) and Amin(A) to refer to the maximum and minimum singular values of the matrix A. 

II. Problem Statement 

The baseband representation of a MIMO channel usually takes values over the complex field. For notational 
convenience, we will work with the real-valued decomposition of these complex matrices. Recall that any equation 
of the form Y = GX + Z over the complex field can be represented by its real-valued representation. 



"Re(Y)- 




lm(Y) 





+ 



Re(Z) 
lm(Z) 



(3) 



"Re(G) -lm(G)l rRe(X) 
lm(G) Re(G) J [lm(X) 

We will henceforth refer to the 2M x 2N real-valued decomposition of the channel matrix as H. We will use 2M 
independent encoders and 2M independent decoders for the resulting real-valued transmit and receive antennasQ 

Definition 1 (Messages): Each of the 2M transmit antennas has a length k data stream (or message) Wm drawn 
independently and uniformly from W = {0,1,2,... ,q — 1}'^. 



Definition 2 (Encoders): Each data stream is mapped onto a length n channel input 
encoder, 

: W ^ . 

An equal power allocation is assumed across transmit antennas 

1 „ 



pnx 1 



by an 



P < SNR . 



n 



While we formally impose a separate power constraint on each antenna, we note that the performance at high SNR 
(in terms of the diversity-multiplexing tradeoff) remains unchanged if this is replaced by a sum power constraint 
over all antennas instead. 

Definition 3 (Rate): Each of the 2M encoders transmits at the same rate 

k 

Rtx = - log2 q ■ 
n 

The total rate of the MIMO system is just the number of transmit antennas times the rate, 2MRtx- 

Remark 1: Since the transmitters do not have knowledge of the channel matrix, we focus on the case where the 

2M data streams are transmitted at equal rates. We will compare the integer-forcing receiver against successive 

cancellation V-BLAST schemes with asymmetric rates in Section IVI-BI 

Definition 4 (Channel): Let X G ^zMxn jj^g matrix of transmitted vectors. 



X 



X 



T 



X 



2M 



(4) 



The MIMO channel takes X as an input, multiplies it by the channel matrix H G M^^^^*^ and adds noise 
Z G M^A^xn Yvhose entries are i.i.d. Gaussian with zero mean and unit variance. The signal Y G M^Afxn observed 
across the 2N receive antennas over n channel uses can be written as 



Y = HX + Z . 



(5) 



'The implementation complexity of our scheme can be decreased slightly by specializing it to the complex field using the techniques in 
l6l . For notational convenience, we focus solely on the real- valued representation, and do not exploit the constraints on the matrix H. 
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We assume that the channel reaUzation H is known to the receiver but unknown to the transmitter and remains 
constant throughout the transmission block of length n. 

Definition 5 (Decoder): At the receiver, a decoder makes an estimate of the messages, 

(wi, . . . , W2A/) = ^?(y)- (V) 

Definition 6 (Achievable Rates): We say that sum rate i?(H) is achievable if for any e > and n large enough, 
there exist encoders and a decoder such that reliable decoding is possible 

Pr ((wi, . . . ,W2m) / (wi, . . . ,W2A/)) < e 
so long as the total rate does not exceed i?(H), 

2Mi?Tx < ^(H). 

III. Existing Receiver Architectures 

Many approaches to MIMO decoding have been studied in the literature. We provide a brief summary of some of 
the major receiver architectures and the associated achievable rates, including the joint ML receiver, the deconelator, 
linear MMSE estimator and the MMSE-SIC estimator. 



A. Joint ML Receivers 

Clearly, the best performance is attainable by joint ML decoding across all N receive antennas. This situation 
is illustrated in Figure [T] Let denote the submatrix of H formed by taking the columns with indices in 
S C {1,2,..., 2M}. If we use a joint ML decoder that searches for the most likely set of transmitted messages 
vectors wi, . . . , W2m> then the following rate is achievable (using Gaussian codebooks at the transmitter): 

M 



R 



JOINT 



mm 
5C{1,2,...,2M} 



■logdet (I5 + SNR H^H^) 



(8) 



where I is the identity matrixH Note that this is also the capacity of the channel subject to equal rate constraints 
per transmit antenna. The worst-case complexity of this approach is exponential in the product of the blocklength 
n and the number of antennas N. 

One approach to reduce the complexity of the joint ML decoder is to employ a sphere decoder. Rather than 
naively checking all possible codewords, the sphere decoder only examines codewords that lie within a ball around 
the received vector. If the radius of the ball is suitably chosen, this search is guaranteed to return the ML candidate 
vector. We refer interested readers to |[T6l - l|20]| for more details on sphere decoding algorithms as well as to II2TI 
for a recent hardware implementation. 



Wi 



WM- 




V 



Wi 



WM 



Fig. 1. MIMO channel with single stream encoding. 



^With joint encoding and decoding, a rate of i logdet (I + SNRHH"^) is achievable. 
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B. Traditional Linear Receivers 

Rather than processing all the observed signals from the antennas jointly, one simple approach is to separate out 
the transmitted data streams using a linear projection and then decode each data stream individually, as shown in 
Figure |2l Given the observed matrix Y = HX + Z from ([Sjl, the receiver forms the projection 

Y = BY (9) 
= BHX + BZ (10) 

where B G R2A/x2Af_ g^^j^ 

row of Y is treated as a noisy version of x^. In traditional linear receivers, the goal 
of the projection matrix B is to separate the incoming data streams. For the decorrelator architecture, we choose the 
projection to be the pseudoinverse of the channel matrix B = (H"^H)~^H^. In the case where > M, the resulting 
channel is interference free. If H is orthogonal, then the decorrelator architecture can match the performance of 
a joint ML decoder. As the condition number of H increases, the performance gap between the decorrelator and 
the joint decoder increases due to noise amplification (see the example in Section IV-BI ). The performance of the 
decorrelator can be improved at low SNR using the MMSE architecture which sets B = H^(HH^ + sNrI)^- 
Let be the m"' row vector of B and the m'^ column vector of H. The following rate is achievable for the 
m* data stream using a decorrelator architecture with Gaussian codebooks: 

B m 1| , SNR||b^.h„.||' 

Since we focus on the case where each data stream is encoded at the same rate, the achievable sum rate is dictated 
by the worst stream, 

i?LiNEAR(H) = min2Mi?^(H) . (12) 

m 

The complexity of a linear receiver architecture is dictated primarily by the choice of decoding algorithm for 
the individual data streams. In the worst case (when ML decoding is used for each data stream), the complexity 
is exponential in the blocklength of the data stream. In practice, one can employ low-density parity-check (LDPC) 
codes to approach rates close to the capacity with linear complexity |[22ll . 

The performance of this class of linear receivers can be improved using successive interference cancellation 
(SIC) f23l, f24l. After a codeword is decoded, it may be subtracted from the observed vector prior to decoding 
the next codeword, which increases the effective signal-to-noise ratio. Let 11 denote the set of all permutations of 
{1,2,..., 2M}. For a fixed decoding order vr S 11, let -Km = {7r(m), 7r(m + 1), . . . , 7r(2M)} denote the indices of 
the data streams that have not yet been decoded. Let h^(m) denote the 7r(m)* column vector of H and let H^^^ 
be the submatrix consisting of the columns with indices -Km, i-c, H^^ = [h^(m) " " " h7r(2M)]- The following rate is 
achievable in the 7r(m)*, stream using successive interference cancellation: 

. ^ 1 ( SNR||b^h^(„j2 \ 

^-<'»'<'^' ^ 2 + ||b.,f.SNKg.,:feh„.,|P j • <»' 

where bm = (H7r,„H^^ + 5iqpl)~^h7r(m) is the projection vector to decode the 7r(m)'^ stream after canceling the 
interference from the 7r(l), . . . , 7r(m — 1)'^ streams. For a fixed decoding order vr, the achievable sum-rate is given 
by 

i?sic,i (H) = mill 2Mi?,(„) (H) . (14) 

The above scheme is referred to as V-BLAST I (see (251 for more details). An improvement can be attained by 
selecting the decoding order, and thus the permutation. 

In the case of V-BLAST II, the sum rate is improved by choosing the decoding order that maximizes rate of the 
worst stream, 

i?sic,2(H) = maxmin2Mi?^(^)(H) . (15) 

Hence, V-BLAST I performs worse than V-BLAST II for all channel parameters. We postpone the discussion of 
V-BLAST III to Section |Vl] where we introduce the outage formulation. 
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Using ML decoding for each individual data stream, the complexity of the MMSE-SIC architecture is again 
exponential in blocklength. However, unlike the decorrelator and linear MMSE receiver, not all M streams can be 
decoded in parallel and delay is incurred as later streams have to wait for earlier streams to finish decoding. 




WM 



Wm 



Fig. 2. A traditional linear receiver. Each of the individual message vectors is decoded directly from the projected channel output. The 
goal of the linear projection is to approximately invert the channel and cancel the interference from other streams. 



C. Lattice-Reduction Detectors 

Another class of linear architectures comes under the name of lattice-reduction detectors. It has been shown 
that lattice reduction can be used to improve the performance of the decorrelator when the channel matrix is near 
singular [26J and can achieve the receive diversity [27.1 . Lattice-reduction detectors are symbol-level linear receivers 
that impose a linear constellation constraint, e.g., a QAM constellation, on the transmitters. The output of the MIMO 
channel Y is passed through a linear filter B to get the resulting output: 

Y = BY (16) 
= BHX + BZ (17) 
= AX + BZ (18) 

where A = BH is the effective channel matrix. In lattice reduction, the effective channel matrix is restricted to be 
unimodular: both its entries and the entries of its inverse must be integers. Let a]^, . . . , a^j^^ be the row vectors of 
matrix A. The lattice -reduction detector produces estimates of the symbols of a^X from Y. There are two key 
differences between the proposed integer-forcing receiver and the lattice-reduction receiver. First, the integer-forcing 
receiver operates on the codeword level rather than on the symbol level. Second, the effective channel matrix A of 
the integer-forcing receiver is not restricted to be unimodular: it can be any full-rank integer matrix. We compare 
lattice reduction to the integer receiver in Example 3 by restricting the effective matrix to be unimodular under the 
integer-forcing architecture. We show that this restriction can result in an arbitrarily large performance gap. 

Two other works have developed lattice architectures for joint decoding that can achieve the optimal diversity- 
multiplexing tradeoff GSll. Il29ll. 

IV. Proposed Receiver Architecture 

A. Architecture Overview 

Linear receivers such as the decorrelator and the MMSE receiver directly decode the data streams after the 
projection step. In other words, they use the linear projection matrix B to invert the channel matrix at the cost of 
amplifying the noise. Although low in complexity, these approaches are far from optimal when the channel matrix 
is ill-conditioned. In the integer-forcing architecture, each encoder uses the same linear code and the receiver 
exploits the code-level linearity to recover equations of the transmitted messages. Instead of inverting the channel, 
the scheme uses B to force the effective channel to a full-rank integer matrix A. As in the case of traditional linear 
receivers, each element of the effective output is then sent to a separate decoder. However, since each encoder uses 
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the same linear code, each decoder can recover an integer linear combination of the codewords. The integer-forcing 
receiver is free to choose the set of equation coefficients A to be any full-rank integer matrix. The resulting integer 
combinations of codewords can be mapped back to a set of full-rank messages over a finite fieldj^ Finally, the 
individual messages vectors are recovered from the set of full-rank equations of message vectors. The details of 
the architecture are provided in the sequel and an illustration is given in Figure |3] 



Wi- 




Wi 



WM 



mod q (21) 



Fig. 3. The proposed integer-forcing linear receiver. Each decoder first recovers an equation of the transmitted message vectors. These 
equations are then collected and solved for the individual message vectors. The goal of the linear projection is to create a full-rank, integer- 
valued effective channel matrix. 

Prior to decoding, our receiver projects the channel output using the 2M x 2N matrix B to get the effective 
channel 

Y = BY (19) 
= BHX + BZ. (20) 

Each preprocessed output ym is then passed into a separate decoder Vm '■ — )■ W. Decoder m attempts to recover 
a linear equation of the message vectors 

'2M 

.e=i 

for some am/ G ^. Let denote the vector of desired coefficients for decoder m, = [a^i am2 • • • am2M]"^- 
We choose ai, . . . , a2M to be linearly independent]^ Decoder m outputs an estimate for the equation u^. We 
will design our scheme such that, for any e > and n large enough, the desired linear equations are recovered 
with probability of error satisfying 

Pr ((ui, . . . , U2A/) / (ui, . . . , U2A/)) < e . (22) 

Let W = [wi • • • W2Af]^ denote the matrix of message vectors, U = [ui • • • U2m]^ denote the matrix of 
linear equations of message vectors and A = [ai • • • a2A/]"'^ denote the integer matrix of equation coefficients. 
Since A is full-rank, the original message vectors can be recovered from the set of linear equations by a simple 
inverse operation: 

W = A-^U (23) 

In the following subsections, we will provide details on the achievable rate, the choice of the coefficients of the 
integer matrix A, and the complexity of our architecture. 

^For the scope of the present paper, we assume that q is prime to ensure invertibility. However, this restriction may be removed as shown 
in (30\. 

*It is sufficient to consider matrices B and desired coefficient vectors that are real-valued decompositions of a complex matrix or 
vector. 
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B. Achievable Rates 

We use the compute-and-forward framework developed in [61 to derive the achievable rate of the integer-forcing 
linear receiver. Let be the m}^ row vector of H. In the case where B = I, the channel output to the decoder 
is given by 

Ym = h^X + (24) 

and the rate at which the set of equations ui , . . . , U2m can be reliably recovered is given in the following theorem. 
Define log^(x) = max{x,0}. 

Theorem 1 ( [6 Theorem 1]): For any e > and n large enough, there exist fixed encoders and decoders, 
£i, . . . , £2M,T>i, . . . , T>2M, such that all decoders can recover their equations with total probability of error at most 
e so long as 

Rtx < mill RiU, am) (25) 

m=l,...,2M 

for the selected equation coefficients ai, . . . , a.2M E Z^^"'^. 

Remark 2: Note that the decoders in Theorem [J are free to choose any equation coefficients that satisfy ( |25] ). 
The encoders are completely oblivious to the choice of coefficients. 

It is instructive to examine the noise term 1 + SNR||hm — a^lP- The leading 1 corresponds to the additive noise, 
which has unit variance in our model. The more interesting term \\h.m — am|P corresponds to the "non-integer" 
penalty since the channel coefficients are not exactly matched to the coefficients a^ of the linear equation. 

As illustrated in Figure |3l we first multiply the channel output matrix Y by a judiciously chosen matrix B. That 
is, the effective channel output observed by the m'^ decoder can be expressed as 

2M 

y^ = E(bM^f + b^Z (27) 

i=l 

= hlX + zl (28) 

where h,„ = H^bm is the effective channel to the m'^ decoder and is the effective noise with variance ||bm|P- 
The achievable rate of the integer-forcing linear receiver is given in the next theorem. 

Theorem 2: Consider the MIMO channel with channel matrix H G ]^2Afx2A/ Under the integer-forcing archi- 
tecture, the following rate is achievable: 

R < min 2Mi? (H , a^ , b„ ) (29) 

m 

i?(H,am,b,n) = -^log^ 



2 ^ Vl|b^||2 + SNR||H^b„-a„||2 

72MX2M r, ^ m>2Mx2N 



for any full-rank integer matrix A € Z ^ and any matrix B e 

Proof: Applying Theorem [T] with effective channel channel = H^b^ and effective noise variance 
it follows that the receiver can reliably recover the set of linear equations ui , . . . , U2m where 



■ 2M 
1=1 



mod q . (30) 



The message vectors wi, . . . , W2m can be solved in turn by inverting the linear equations, W = A~^U. ■ 
Theorem |2] provides an achievable rate for the integer-forcing architecture for any preprocessing matrix B and 
any full-rank integer matrix A. The remaining task is to select these matrices in such a way as to maximize the rate 
expression given in Theorem |2] This turns out to be a non-trivial task. We consider it in two steps. In particular, 
we first observe that for a fixed integer matrix A, it is straightforward to characterize the optimal preprocessing 
matrix B. Then, in the next subsection, we discuss the problem of selecting the integer matrix A. 
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We consider the case when N > M and note that given a fixed full-rank integer matrix A, a simple choice for 
preprocessing matrix is 

Bexact = HU (31) 

where H^^ is the pseudoinverse of H. We call this scheme "exact" integer-forcing since the effective channel 
matrix after preprocessing is simply the full-rank integer matrix A. We also note that choosing Bexact and A = I 
corresponds to the decorrelator. More generally, the performance of exact integer-forcing is summarized in the 
following corollary. 

Corollary 1: Consider the case where > M. The achievable rate from Theorem |2] can be written equivalently 

as 

R < min 2Mi?(H, a„) (32) 

m 

R(U, Rm) = I log f „.TTr^t^ 119 1 (33) 



2 ^Vlim^a^P 
for any full-rank integer matrix A by setting B = H^^A. 

We call the expression in the denominator of (l33l) the "effective noise variance." The achievable rate in (l32l) is 
determined by the largest effective noise variance, 

'5'effective = max ||(H^)'^am|p . (34) 

m 

Hence, the goal is to choose linearly independent equations ai , a2 , . . . , sl2M to minimize the expression a^ppg^ivE 
(l34l ). The integer-forcing receiver provides the freedom to choose any full-rank integer matrix A. In the remainder 
of this section, we characterize the optimal linear projection matrix B for a fixed coefficient matrix A and provide 
an equivalent rate expression for Theorem [2] We will then use this expression in the Section [TV-CI to provide insight 
into how to select the optimal coefficient matrix A. 

Corollary 2: The optimal linear projection matrix for a fixed coefficient matrix A is given by 

BoPT = AH^ ^^77^1 + HH^l . (35) 



SNR 

Remark 3: The linear MMSE estimator, given by Bmmse = (gi^^I + HH-^) ^, is a special case of the 
integer-forcing receiver with Bqpt and A = I. 

Remark 4: limsNR->^oo Bqpt = Bexact- Hence, under a fixed channel matrix, exact integer-forcing is optimal as 
SNR oo. 

Proof of Corollary^ Let B = [bi, • • • , b2A/]^. We solve for each b^ separately to maximize the achievable 
rate in Theorem |2j 

, 1 / SNR 

bm = arg max - log 



2 ^ V llbrnIP + SNR||H^b, 



b™ SNR 

Define this quantity to be the function fihm) and rewrite as follows: 

1 



argmin— ^||bm||^ + ||H^bm - a^lP • (36) 



/(b™) = ^||b„||2 + ||H^b^-a„||2 (37) 
^ b^b„ + (H^b^-a„)^(H^b„-a„) (38) 



SNR 

TTT^^rri^rn ~\~ b^HH b^ — 2b^Ham + a^a^ 



SNR 

1 



(39) 



g^^I + HH^J b™-2U^Ha^ + a>^ (40) 

(41) 
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Taking the derivative of / with respect to b^, we have that 



dhm VSNR 
Setting '^Al^"'-* = and solving for hm, we have that 



df{hm) 1 J ^ ) _ 2Ha^. (42) 



dh„_ 

1 



= a^H^ ( + HH^ ) . (43) 



1 

Corollary |2] follows since B = [bi, • • • , b2M]^- ■ 
Using the optimal linear projection matrix from Corollary |2j we derive an alternative expression for the achievable 
rate in Theorem |2] 

Theorem 3: The achievable rate from Theorem |2] under the optimal projection matrix Bqpt from (l35l) can be 
expressed as 

R < mill 2MR{U, a„) (44) 

m 

i?(H, a„) = log a^VDV^a™ , (45) 

where V G ]]j2A/x2A/ ^j^g matrix composed of the eigenvectors of H^H and D G ]^2Mx2Af ^ diagonal matrix 
with elements 

„ f CMDN2 1 1 i < rank(H) 

D,,,= SNR^Hi - (46) 
[ 1 ^ > rank(H) 

and Aj is the singular value of H. 

Proof: Let / be defined as in (l37l) and define USV^ to be the singular value decomposition (SVD) of H with 
U G M2Af^27V^ 53 g ]^27Vx2A/^ ^j^^ y G M2A^x2Af Note that in this SVD representation, S^,, = A, and T^ij = for 
all i 7^ j. Evaluating / for the m^^ row b^ of Bqpt yields 



/(bm) = 5|^t»^b„ + b^HH^bm - b^Ha^ - a^H'^b^ + a^a^ 



(47) 



( t^kI + HH^ ] b^ - b^Ha^ - a^H^b„ + a^a^ (48) 



^SNR 

Combining (03]) and (|48]), it follows that 

/(b„) = hi (^^I + HH^^ (^^I + HH^^ Ha„ - b^Ha„ - a^H^b„ + a^a„ (49) 
= b^Ha^ — b^Ha^ — ^^ii^hm + ^l^m (50) 

~ ~3-m-^^bm + (51) 

-1 



1 



-a;^H^ ( + HH^ ) Ha„ + a^a^ (52) 



(53) 



= -a^^VS^U^ (^^I + USE^U^^ USV^a^ + a^Ia^. 
Since U is an orthonormal matrix, U^^ = and (l53l) can be rewritten as follows 

/(b^) = -a^VS^U^ |^_i_uiU^ + USS^U^^ USV^a^ + a^Ia„ (54) 

= -a^VS^U^(U^)-i (^-^I + SS^^ U-^USV^a™ + a^Ia^ (55) 
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Since V is an orthonormal matrix, V ^ = and ( [56l ) can be rewritten as follows 

/(b^m) = l^-VS^ (^-^I + SS^^ SV^ + VV^ j (57) 

= a^V f-S^ fp^I + SS^^ S + I|V^a^ (58) 



SNR 



SNR 



-1 



a^V ( I - ( -^I + SS^ ] S 1 V^a^ (59) 



a^VDV^a^^ . (60) 



Putting everything together, we have that 



i?(H,a„) = -^loga^VDV^a^ . (61) 



C. Choosing Equations 

In the previous section, we explored choices of the preprocessing matrix B and characterized the optimal B for 
a fixed full-rank integer matrix A. Now, we discuss how to select equation coefficients ai, • • • ,sl2m to maximize 
the achievable rate in Theorem [2] or, equivalently. Theorem [3l In the integer-forcing linear receiver, we are free to 
recover any full-rank set of linear equations with integer coefficients. However, due to the integer constraint on A, 
it does not appear to be possible to give a closed-form solution for the best possible full-rank matrix A. 

An initially tempting choice for A might be A = I. As we noted previously, for this choice of A, selecting 
B = H^^ reduces to the decorrelator while selecting B = H-^ (sNr-'- + HH^) yields the linear MMSE estimator. 
However, as we show, for most channel matrices, fixing A = I is suboptimal. 

From Theorem [3l the achievable rate under the fixed channel matrix A = [ai, • • • , a2M]'^ is given by 

R < max mill (-M log alj VDV'^a„) . (62) 

|A|7^0 rn ' 

In general, for a fixed SNR and channel matrix, finding the best coefficient matrix A appears to be a combinatorial 
problem, requiring an explicit search over all possible full-rank integer matrices. The following lemma shows how 
the search space can be somewhat reduced. 

Lemma 1: To optimize the achievable rate in Theorem [3] (or, equivalently, in Theorem|2]), it is sufficient to check 
the space of all integer vectors a^ with norm satisfying 

||a,„f < l + A^^SNR . (63) 

where Amax is the maximum singular value of H. 

Remark 5: This lemma thus shows that an exhaustive search only needs to check roughly SNR*'^ possibilities. 
Proof: From (l62l ). the achievable rate of the integer-forcing receiver is zero for all a^ satisfying 

a^VDV^a^ > 1 (64) 

The left-hand side is lower bounded by 

a^VDV^a„ = \\D^I^\^a.mf (65) 

2M 

= J^A,»|vfa^|2 (66) 

i=l 

> minZ)j^j||a|p (67) 

i 

^ " (68) 



1 + A2,,SNR 

Hence, if lla^lp > 1 + A^.SNR, then a^VDV'^a„ > 1. 
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Fig. 4. The decorrelator (left) fixes the equations to be ai = [1 0]"^ and a2 = [0 1]"^. The integer-forcing linear receiver (right) allows 
for any choice of linearly independent equations. Equations should be chosen in the direction of vmax to avoid noise amplification by -r-^ — . 

To conclude this subsection, we will now explicitly show how and why the choice A = I is indeed suboptimal 
in general. In this context, it is instructive to use Lemma [T] to restate (l62l ) as 

R< max mill (-M legal! VDV^a„). (69) 

|A|^0 rn 
l|a™|P<l+Ag,^SNR 

Equation ( [69l ) suggests that we should choose coefficient vectors ai , . . . , a2M to be short and in the direction of 
the maximum right eigenvector of H. To make this concrete, let us study a particular 2x2 real MIMO channel 
for which the matrix H has singular values Amin and Amax> with corresponding right singular vectors Vmin and 
Vmaxi respectively, as illustrated in Figure HI Here, decoder 1 recovers a linear combination of the transmitted 
message vectors with integer coefficients ai = [ai.i ^1,2]^ and decoder 2 recovers a linear combination with 
integer coefficients a2 = [02.1 (12,2]'^ ■ Using the exact integer-forcing rate from Corollary [T] the following rate is 
achievable 

R < min log ( -375- ) (70) 

m=l,2 \ai, J 

where can be interpreted as the effective noise variance for the m}^ decoder, 

~2 _ 1 I T |2 I ^ I T |2 /Tlx 

'^m ~ T2 I'^MlN^ml + T2 I^MAX^ml • {' i^) 

Since > j^^, ( TtTI ) suggests that ai, a2 should be chosen in the direction of v^ax subject to linearly independent 
constraints to reduce the noise amplification by In the case of the decorrelator (or MMSE receiver), the equation 
coefficients are fixed to be ai = [1 0]^ and a2 = [0 1]^. As a result, the noise variance in at least one of the 
streams will be heavily amplified by and the rate will be limited by the minimum singular value of the channel 
matrix. With integer-forcing, we are free to choose any linearly independent ai , a2 since we only require that our 
coefficients matrix A be invertible. By choosing ai,a2 in the direction Vmax> we are protected against large noise 
amplification in the case of near-singular channel matrices. 

D. Complexity 

Our architecture has the same implementation complexity as that of a traditional linear receiver with the addition 
of the matrix search for A. The ideal joint ML receiver aggregates the time and space dimensions and finds the ML 
estimate across both. As a result, its complexity is exponential in the product of the blocklength and the number 
of data streams. Our architecture decouples the time and space dimensions by allowing for single-stream decoding. 
First, we search for the best integer matrix A, which has an exponential complexity in the number of data streams 
in the worst case. For slow fading channels, this search is only needed once per data frame. Afterwards, our receiver 
recovers M linearly independent equations of codewords according to A and then solves these for the original 
codewords. This step is polynomial in the number of data streams and exponential in the blocklength for an ML 
decoder. In practice, the decoding step can be considerably accelerated through the use of LDPC codes and the 
integer matrix search can be sped up via a sphere decoder. 
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V. Fixed Channel Matrices 

In this section, we compare the performance of the integer-forcing linear receiver against existing architectures 
through a series of examples. In Example 1, we compare the performance of different architectures for an ill- 
conditioned channel matrix and demonstrate that the choice of equation coefficients for the integer-forcing receiver 
changes with SNR. In Example 2, we compare the performance of the integer-forcing receiver with the decorrelator 
and show that the decorrelator can perform arbitrarily worse. In Examples 3, we illustrate that the gap between the 
integer-forcing receiver and lattice reduction can become unbounded. Finally, we show that the gap between the 
integer-forcing receiver and the joint decoder can be arbitrarily large in Example 4. 

A. Example 1 

Consider the 2x2 real MIMO channel with channel matrix 



H 



0.7 1.3 
0.8 1.5 



(72) 



Figure |5] shows the performance of the different architectures. (Recall that we assume equal-rate data streams on 



o 
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□ Decorrelator 
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SNR (dB) 

Fig. 5. Achievable rates for the 2x2 real-valued MIMO channel with fixed channel matrix from il2\ . 

both transmit antennas, as in Definition [3]) The achievable rates for traditional linear receiver are given by (fT2l) 
and that of the joint receiver is given by ([8]l. The decorrelator and the MMSE receiver aim to separate the data 
streams and cancel the interference from other streams. However, this is difficult since the columns of the channel 
matrix are far from orthogonal. The integer-forcing architecture attempts to exploit the interference by decoding two 
linearly independent equations in the direction of the maximum eigenvector vmax = [0.47 0.88]"^. For example, at 
SNR = 30di3, we choose equation coefficients ai = [1 2]^ and a2 = [6 ll]'^, while for SNR = 40dB, we choose 
equation coefficients ai = [1 7]^ and a2 = [2 13]^. Thus, for different values of SNR, the optimal equation 
coefficients generally change. 



B. Example 2: Integer-forcing vs. decorrelator 

Consider the 2x2 real MIMO channel with channel matrix: 

1 1 + ^/i' 
e 



H 



(73) 
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where we assume < e ^ 1, ^ is an integer and SNR » 1. We first note 



that 



H i = - 

e 



1 



Using ([T2I) with B = H ^, the achievable rate of the decorrelator is 



^DECORR = 2 min <{ ^ log I 1 + 



e^SNR 



-log(l + e2SNR) 



<log(l + e2SNR) 



(74) 

(75) 
(76) 



We compare the achievable rate of the decorrelator with the exact integer-forcing rate from Corollary [T] The equation 
coefficients selected by the decoders are 

ai = [l If 

T 

a2 



1 1 



(77) 
(78) 



Using Corollary [U the achievable rate of exact integer-forcing with equations coefficients A = [ai,a2]^ is 



-RiNTEGER = 2 min - log 

m=l,2 Z 



SNR 



2 min 



1 



= log 
> log 
= log 



2 

SNR 



log 



SNR 



1 + ^ 



1 )'2^°g 



SNR 



1 



^1+ ^ 

'snr' 



eSNR\ 



(79) 
(80) 

(81) 

(82) 
(83) 



where the inequality follows since < e <C 1. 

We compare the two linear architectures to the joint ML decoder whose achievable rate is given by dH). For 
< e ^ 1 and SNR ^ 1, the rate of the joint decoder is 



i?joiNT = ^ log det (I + HH^SNR) 



(84) 
(85) 

Finally, let us compare the three rates in the setting where SNR — 00, and where the parameter e in our channel 



log ( (1 + SNR)(1 + e^SNR) + (l + SNR 



model tends to zero accordindj to e 



/SNR 



. In that special case, we can observe that 



-Rdecorr ~ 1 

-RlNTEGER ~ ^ log (SNR) 

i?joiNT~ ^log(SNR) 



(86) 
(87) 

(88) 



Hence, the loss from using the decorrelator instead of the integer-forcing receiver becomes unbounded in this regime 
as SNR — )■ 00. Furthermore, the integer-forcing receiver achieves the same scaling as the joint decoder. 



^Recall that /(SNR) ~ ^(SNR) implies that limsNR- 



/(SNR) 
9(SNR) 
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C. Example 3: Integer-forcing vs. lattice-reduction 

In this example, we illustrate the difference between the proposed integer-forcing architecture and the lattice- 
reduction receiver. First, we note that, unlike the integer-forcing receiver, the lattice-reduction receiver is not required 
to use a lattice code but it should use a constellation with regular spacing, such as PAM or QAM. However, the key 
difference is that the effective channel matrix for lattice-reduction receivers is restricted to be unimodulai^ while 
the effective channel for integer-forcing receivers can be any full-rank integer matrix. In this example, we show 
that this restriction can result in an arbitrarily large performance penalty. We consider the M x M MIMO channel 
with channel matrix: 



H 



10 

10 



-1 -1 -1 -1 








(89) 



A simple calculation shows that the inverse of this channel matrix is given by 



H 



-1 




1 





1 

2 



(90) 



Since H ^ has non-integer entries, H is not unimodular. The coefficient matrix that maximizes the achievable rate 
for the exact integer-forcing receiver from Corollary [T] is 

-A-iNTEGER = H , 

leading to an effective noise variance in each stream that satisfies 

'-'^INTEGER -'- • (91) 

By contrast, following the lattice-reduction receiver, we must ensure that the resulting effective channel matrix is 
unimodular. Using the fact that (H^) ^ is a basis for the body-centered cubic (BCC) lattice, it can be shown that 
the best choice of matrix is 

-A- UNIMODULAR = I • (92) 

It follows that the effective noise variance in the worst stream is given by 

2 " 1 1 / T\ — 1 1 2 

"-""unimodular ~ rnin II (H j a-UNIMODULAR,m|| (93) 
m 

= max{M/4, 1} . (94) 

Hence, as the number of antennas becomes large (M — oo), restricting the effective matrix to be unimodular 
results in an arbitrarily large loss. 



D. Example 4: Integer-forcing vs. joint ML decoder 

Finally, we illustrate the point that the integer-forcing receiver can sometimes be arbitrarily worse than optimal 
joint decoding. To see this, we consider a 2 x 2 MIMO channel with the channel matrix: 

l" 



H 



(95) 



^Recall that a matrix is unimodular if has integer entries and its inverse has integer entries. 
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where < e < 1. The rate attainable via joint ML decoding is 

i?joiNT = log((l + 2SNR)(1 + e^SNR) - e^SNR^) (96) 
> log(l + 2SNR) . (97) 

We note that the inverse of the channel matrix is given by 

-1 



H 



i 



(98) 



We bound the achievable rate of exact integer-forcing from Corollary \T\ as follows 



1, / SNR \ 

i?iNTEGER = 2 rnax mm - log ,.,^^,^1 (99) 

A:|A|^0m=l,2 2 \ || (H^) a^Py 

= 2 max mill - log f — SHR \ (100) 

A:|A|^0m=l,2 2 \ ^ + (0^,2 " am,l) F / 



SNR 

a™,2"^an.,i ° \ a'^ i + (am,2 " Om.l)^^^ 

<log(e2SNR) . (102) 



< max log ( -J- 1 (101) 



Let e ~ , \ and consider the regime SNR — )• oo. The gap between the optimal joint receiver and the integer- 
forcing linear receiver can be arbitrarily large. However, as we will see in Section |VI1 the average behavior of the 
integer-forcing linear receiver is close to the joint decoder under a Rayleigh fading distribution for medium to high 
SNR. 

VI. Performance for Slow Fading Channels 

A. Model and Definitions 

We now demonstrate that integer-forcing receiver nearly matches the performance of the joint ML decoder under 
a slow fading channel model. Since the integer-forcing receiver can mimic the operation performed by a zero- 
forcing or MMSE receiver (as well as decode messages via equations), it is not surprising that it offers higher 
rates. However, these architectures are often coupled with some form of SIC. We will show that the integer-forcing 
receiver outperforms the following standard SIC architectures: 

• V-BLAST I: The receiver decodes and cancels the data streams in a predetermined order, irrespective of the 
channel realization. Each data stream has the same rate. See ([141) for the rate expression. 

• V-BLAST II: The receiver selects the decoding order separately for each channel realization in such a way as 
to maximize the effective SNR for the data stream that sees the worst channel. Each data stream has the same 
rate. See ( fTSl ) for the rate expression. 

• V-BLAST III: The receiver decodes and cancels the data streams in a predetermined order. The rate of each 
data stream is selected to maximize the sum rate. The rate expression is given in Section IVI-BI 

In Sections [VI-CI and lVI-D[ we compare these schemes through simulations as well as their diversity-multiplexing 
tradeoffs. For completeness, we also compare integer-forcing to an SIC architecture that allows for both variable 
decoding order and unequal rate allocation in Appendix |Al 

We adopt the standard quasi-static Rayleigh fading model where each element of the complex channel matrix 
is independent and identically distributed according to a circularly symmetric complex normal distribution of 
unit variance. The transmitter is only aware of the channel statistics while the receiver knows the exact channel 
realization. As a result, we will have to cope with some outage probability pouT- 

Definition 7: Assume there exists an architecture that encodes each data stream at the same rate and achieves 
sum rate R{ii). For a target rate R, then the outage probability is defined as 

Pom{R) = Pr(i?(H) < R). (103) 

For a fixed probability p G (0, 1], we define the outage rate to be 

Rom{p) = sup{i? : pom{R) < p}- (104) 
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B. Rate Allocation 

We have assumed that each data stream is encoded at the same rate. This is optimal for linear receivers under 
isotropic fading. However, when SIC is used, rate allocation can be beneficial in an outage scenario. To compare the 
performance of integer-forcing to SIC with rate allocation, we consider V-BLAST III in this section. V-BLAST III 
performs SIC with a fixed decoding order and allows for rate allocation among the different data streams. Without 
loss of generality for Rayleigh fading, if we fix a decoding order, we may take it to be vr = (1, 2, . . . , 2M). From 
([T3] ). the achievable rate for stream m is 



Since the streams are decoded in order, the later streams will achieve higher rates on average than the earlier 
streams. V-BLAST III allocates lower rates to earlier streams and higher rates to later streams. We now generalize 
Definition [T] to include rate allocation. 

Definition 8: Assume an architecture that achieves rate Rm(H-) in stream m. For a target rate R, the the outage 
probability is given by: 



C. Outage Behavior 

We now compare the outage rate and probabilities for the receiver architectures discussed above. It is easy to see 
that the zero-forcing receiver performs strictly worse than the MMSE receiver and V-BLAST I performs strictly 
worse than V-BLAST II. We have chosen to omit these two architectures from our plots to avoid overcrowding. 
Figure [6] shows the 1 percent outage rate and Figure |7] shows the 5 percent outage rate. In both cases, the integer- 
forcing receiver nearly matches the rate of the joint ML receiver while the MMSE receiver achieves significantly 
lower performance. The SIC architectures with either an optimal decoding order, V-BLAST II, or an optimized 
rate allocation, V-BLAST III, improve the performance of the MMSE receiver significantly but still achieve lower 
rates than the integer-forcing receiver from medium SNR onwards. Our simulations suggest that the outage rate of 
the integer-forcing receiver remains within a small gap from the outage rate of the joint ML receiver. However, we 
recall from the example given in Subsection I V-D I that it is not true that the integer-forcing receiver is uniformly near- 
optimal for all fading realizations. Figure [8] shows the outage probability for the target sum rate R = 6. We note that 
the integer-forcing receiver achieves the same slope as the joint decoder. In the next subsection, we characterize 
the diversity-multiplexing tradeoff of the integer-forcing receiver and compare it with the diversity-multiplexing 
tradeoff of traditional architectures that are considered in lf25l . We show that the integer-forcing receiver attains the 
optimal diversity multiplexing-tradeoff in the case where each transmit antenna sends an independent data stream. 

D. Diversity-Multiplexing Tradeoff 

The diversity-multiplexing tradeoff (DMT) provides a rough characterization of the performance of a MIMO 
transmission scheme at high SNR 1251 . 

Definition 9: A family of codes is said to achieve spatial multiplexing gain r and diversity gain d if the total 
data rate and the average probability of error satisfy 




(105) 




(106) 



For a fixed probability p € (0, 1], we define the outage rate to be: 

Romip) = sup{R : Pout(^) < p} ■ 



(107) 



lim 
SNR-s-oo 



lim 

SNR^oo 



fi(SNR) ^ 
log SNR - 
log Pe (SNR) 
log SNR 



< -d. 



(108) 



(109) 
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SNR (dB) 



Fig. 7. 5 percent outage rates for the 2x2 complex-valued MIMO channel with Rayleigh fading. 



In the case where each transmit antenna encodes an independent data streanfl the optimal DMT is 

d.o.NT(r) = TV (l - ^) 

'if joint encoding across the antennas is permitted, then a better DMT is achievable. See 1.25] for more details. 



(110) 
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Fig. 8. Outage probability for the 2x2 complex-valued MIMO channel with Rayleigh fading for a target sum rate of i? = 6. 



where r G [0,M] and can be achieved by joint ML decoding |[25l . The DMTs achieved by the decorrelator and 
SIC architectures are as follows |[25l: 



dDECORR(r) = (l - ^) (111) 
fiv-BLASTl(r)= (l-^) (112) 

dv-BLASTn(r)<(iV-l)(l-^) (113) 

c^v-BLASTiiK?^) = piecewise linear curve connecting points {rk,n — k) (114) 

fc-i , 

E/c — z 
I < k < n 
n — i 

i=0 

The decorrelator chooses the matrix B to cancel the interference from the other data streams. As a result, the noise 
is heavily amplified when the channel matrix is near singular and the performance is limited by the minimum 
singular value of the channel matrix. In the integer-forcing linear receiver, the effective channel matrix A is not 
limited to be the identity matrix but can be any full-rank integer matrix. This additional freedom is sufficient to 
recover the same DMT as the joint ML decoder. 

Theorem 4: For a MIMO channel with M transmit, N > M receive antennas, and Rayleigh fading, the achievable 
diversity-multiplexing tradeoff for the integer-forcing receiver is given by 

where re [0, M]. 

The proof of Theorem |4] is given in Appendix |B] Figure |9] illustrates the DMT for a 4 x 4 MIMO channel. The 
integer-forcing receiver achieves a maximum diversity of 4 while the decorrelator and V-BLAST I achieve can 
only achieve a diversity of 1 since their performance is limited by the worst data stream. V-BLAST II achieves a 
higher DMT than V-BLAST I but a lower diversity than the integer-forcing receiver since its rate is still limited 
by the worst stream after the optimal decoding order is applied. V-BLAST III achieves the optimal diversity at the 
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point r = since only one data stream is used in transmission. For values of r > 0, the achievable diversity is 
suboptimal. 




12 3 4 

multiplexing gain r 



Fig. 9. Diversity-multiplexing tradeoff for the 4x4 MIMO channel with Rayleigh fading. 



E. Discussion 

As noted earlier, receiver architectures based on zero-forcing face a rate penalty when the channel matrix is 
ill-conditioned. Integer-forcing circumvents this issue by allowing the receiver to first decode equations whose 
coefficients are matched with those of the channel. From one perspective, the resulting gains are of a similar nature 
as those obtained by lattice-reduction receivers. One important difference is that integer-forcing applies a modulo 
operation at the receiver prior to decoding, which retains the linear structure of the codebook. This allows us to 
derive closed-form rate expressions analogous to those for traditional linear receiver. Typically, lattice reduction is 
used at the symbol level followed by a decoding step f26l. While this form of lattice reduction can be used to 
obtain the full receive diversity fT7\, it does not seem to suffice in terms of rate. 

Another key advantage of integer-forcing is that it completely decouples the spatial aspect of decoding from the 
temporal aspect. That is, the search for the best integer matrix A to approximate the channel matrix H is completed 
before we attempt to decode the integer combinations of codewords. Thus, apart from the search|^ for the best A, 
which in a slow-fading environment does not have to be executed frequently, the complexity of the integer-forcing 
receiver is similar to that of the zero-forcing receiver. 

From our outage plots, it is clear that the integer-forcing receiver significantly outperforms the basic MMSE 
receiver. Moreover, integer-forcing beats more sophisticated SIC -based V-BLAST architectures, even when these 
are permitted to optimize their rate allocation while integer-forcing is not. We note that it is possible to develop 
integer-forcing schemes that permit unequal rate allocations as well as a form of interference cancellation ||3TI 
but this is beyond the scope of the present paper. 

Integer-forcing also attains the full diversity-multiplexing tradeoff, unlike the V-BLAST architectures discussed 
above. Earlier work developed lattice-based schemes that attain the full DMT ||28l . |[29l but, to the best of our 
knowledge, ours is the first that decouples spatial decoding from temporal decoding. The caveat is that the DMT 
result presented in the current paper only applies if there is no spatial coding across transmit antennas, whereas 
the DMT results of f28l, (29 \ apply in general. Characterizing the DMT of integer-forcing when there is coding 
across transmit antennas is an interesting subject for future study. 



'This search can be considerably sped up in practice through the use of a sphere decoding algorithm. 
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VII. Oblivious Interference Mitigation 
A. MIMO Channel with Interference 

We have studied the performance of the integer-forcing linear receiver under the standard MIMO channel and 
found that it achieves outage rates close those of the joint ML decoder as well as the same DMT. In this section, we 
show that integer-forcing architectures are also successful at dealing with a different kind of channel disturbance, 
namely interference. We assume that the interfering signal is low-dimensional (compared to the number of receive 
antennas), and we are most interested in the case where the variance of this interfering signal increases (at a certain 
rate) with the transmit power. We show that the integer-forcing architecture can be used to perform "oblivious" 
interference mitigation. By oblivious, we mean that the transmitter and receiver are unaware of the codebook of 
the interferer (if there is one). However, the receiver knows which subspace is occupied by the interference. By 
selecting equation coefficients in a direction that depends both on the interference space and on the channel matrix, 
the integer-forcing receiver reduces the impact of interference and attains a significant gain over traditional linear 
receivers. We will characterize the generalized degrees-of-freedom show that it matches that of the joint ML decoder. 

Remark 6: Oblivious receivers have been thoroughly studied in the context of cellular systems ll32l and distributed 
MIMO 11331 . 

For ease of notation and tractability, the discussion presented in this section is limited to channels whose channel 
matrix is square, i.e., with equal number of transmit and receive antennas. Recall that the real-valued representation 
of the M X M complex-valued MIMO channel (see Definition |4]) is given by 

Y = HX + Z (116) 

where Y G ]^2Mxn ^j^g channel output, H € ]]j2Mx2J\/ ^j^g real- valued representation of the fading matrix, 
^2Afxn jj^g channel input, and the noise Z G ]^2A/xn j^^^ j Gaussian entries with unit variance. In this 
section, we extend the standard MIMO channel to include the case of interference. The generalized model has 
channel output 

Y = HX + JV + Z (117) 

where H is the channel matrix, X is the channel input, and Z is the noise, all as in the previous model. An external 
interferer adds V € M^^^" in the direction represented by the column space of J € m2A/x2A" ^g assume that 
each element of V is i.i.d. Gaussian with variance I NR. We assume that J is fixed during the whole transmission 
block and known only to the receiver. 

The definition for messages, rates, encoders, and decoders follow along similar lines as those for the standard 
MIMO channel (see Definitions [B 111 S and [3] in Section HT]). 



B. Traditional Linear Receivers 

As in the case without interference, traditional linear receivers process the channel output Y by multiplying it 
by a 2M x 2M matrix B to arrive at the effective output 

Y = BY (118) 

and recover the message using only the row of the matrix Y. By analogy to ([T2l ). the achievable sum rate 
can be expressed as 

i?LiNEAR(H,J,B) =min2Mi?^(H,J,B). (119) 
where i?m(H, J,B) represents the achievable rate for the m"^ data stream (using Gaussian codebooks), 

o r„ T - 1 (^ + SNRiib^h^f \ 

i?^(H, J,B) - -log 1^1 + ii^^ii^ ^ INRIU^b^f + SNRE.^^ ||b^h.f ) ' 

Again, let us discuss several choices of the matrix B. The decorrelator, given by B = H^^, removes the 
interference due to other data streams but does not cancel the external interference J (except in the very special 
case where the subspace spanned by J is orthogonal to the subspace spanned by H~^). Alternatively, if we choose 
B = J-*-, where J"*" is the 2K x 2M matrix whose rowspace is orthogonal to the columnspace of J, then the 
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external interference term is indeed nulled. The resulting output J Y can then be processed by a traditional linear 
receiver. This scheme achieves good performance in high INR regimes but does not perform well in high SNR 
regimes since the interference due to the other data streams is mostly unresolved. The MMSE receiver improves 
the performance of the both architectures by choosing B = H (^iq^I + j'r^ JJ^ + HH^) . However, since there 
are 2M data streams and the interference is of dimension 2K, it is impossible to cancel both the interference from 
other data streams and the external interference with any matrix B. One way out of this conundrum is to reduce 
the number of transmitted streams to 2M — 2K. For this scenario, the MMSE receiver can be applied to mitigate 
both the external interference and the interference from other data streams. 

Complexity permitting, we can again improve performance by resorting to successive interference cancellation 
architectures. The achievable rate for V-BLAST I in the standard MIMO channel from (fT4l) becomes 



i?sic,i(H) = min2Mi?,(^)(H). 



where 



and 




SNRIIb^h 



7r(m) I 



|2 + INR||J^b„||2 + SNR^ 



l^m^7r(i) I 



(120) 



(121) 



7r(m 



the rate in (11201 ) over all decoding orders vr € 11. 



). The achievable rate for V-BLAST II follows by maximizing 



C. Integer-Forcing Linear Receiver 

We apply the integer-forcing linear receiver proposed in Section |IV] to the problem of mitigating interference 
(see Figure [TOl ). The channel output matrix Y is first multiplied by a fixed matrix B to form the matrix Y whose 



m 



row is the signal fed into the m* decoder. Each such row can be expressed as 



2M 

yL = 5](b™hi)xf + b^JV + b^Z 

i=l 
2M 



(122) 



(123) 



where = H^b^ is the effective channel to the decoder, is the effective interference with variance 

llJ^b^lplNR, and is the effective noise with variance ||b,„|p. The next theorem and its following remarks 

generalize Theorem |2l Corollary [T] and Corollary |2] to include the case with interference. 

zi 



Wi- 




Wi 



mod q 



Fig. 10. Integer-forcing linear receiver for the complex-valued Ad x M MIMO channel with external inference of dimension K. 



Theorem 5: Consider the MIMO channel with channel matrix H G ^2Mx2M interference matrix J G 
^2Mx2K_ Pot any full-rank integer matrix A G z'^Mx2M 2M x 2M matrix B = [bi • • • b2A/]^, the 

following sum rate is achievable using the integer-forcing linear receiver: 



R{U, J, A, B) < min 2Mi?(H, J, a^, b„ 



(124) 
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Fig. 11. The decorrelator (left) fixes the equations to be ai = [1 0]"^ and a2 = [0 1]"^. The integer-forcing Linear Receiver (right) allows 
for any choice of linearly independent equations. Equations should be chosen in the direction orthogonal to J = H~^J. 



where i?(H, J, a^, bm) is given by 



SNR 



PSNR 



|b^||2 + ||jrb„||2|NR+||H^b„- 
Remark 7: Exact integer-forcing selects B = AH^^. The achievable rate can be expressed more concisely as 

SNR 



R < minMlog 



(H 



+ ||J^(H-iFa^||2|NR 



Remark 8: The optimal projection matrix that maximizes the achievable rate in Theorem [5] is given by 

-1 



AH^ ( HH^ + J J 



T 



INR 



+ 1: 



1 



SNR SNR 



(125) 



(126) 



D. Geometric Interpretation 

In the case without interference, the equation coefficients ai, • • • ajv/ should be chosen in the direction of the 
maximum eigenvector of H^H to minimize the effective noise (see Figure |4|. When the interference is large, 
the equations coefficients should instead be chosen as close to orthogonal to the effective interference as possible. 
Consider the (suboptimal) rate expression in (1125b . The "effective" noise variance in the stream is 



aEFFEc. = ||(H-i)^a^f + ||J^'(H-^)^ a^f INR . (127) 
be the maximum singular value of and J = J. The effective noise variance can be bounded by 

pINR . (128) 



Let A^ 



CEFFECr 



— -^MAxll^n 



In the high interference regime (INR 3> 1), the equation coefficients should be chosen orthogonal to the direction 
of the "effective" interference J to minimize the effective noise variance. This is illustrated in Figure [TT] In the 
case of traditional linear receivers, the equation coefficients are fixed to be the unit vectors: ai = [1 • • • 0]-^, a2 = 
[0 1 • • • 0]^, . . . , aL2M = [0 • • • 1]-^. As a result, the interference space spanned by J has significant projections 
onto at least some of the decoding dimensions a^. By contrast, in the case of the integer-forcing linear receiver, 
since ai , • • • , a2A/ need only be linearly independent, we can choose all of the decoding dimensions a^ to be close 
to orthogonal to J. 



E. Fixed Channel Example 

To illustrate the impact of choosing equation coefficients in a fashion suitable to mitigate external interference, 
we consider the 2x2 MIMO channel with channel matrix H and one-dimensional interference space J given by 

(129) 



1 


"2 1' 




"L + 2" 


3 


1 2 


-I 


2L + 1 
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1 


■ 2 -1 




"L + 2" 


3 


-1 2 




2L + 1 


"1 






L 


V + Z 





where L G N. In the case of the decorrelator, we invert the channel to arrive at the effective output: 

V + Z (130) 

X+ ; V + Z (131) 
where Z = H^^Z. For INR » 1, the effective noise variances scale as 

^DECORR,! ~ (132) 
^DEC0RR,2 ~ ^'INR (133) 

Using the integer-forcing linear receiver with the choice of equations ai = [1 0]^ and a2 = [—L 1]^, the effective 
channel output to the second decoder is 

[-L 1] Y = -Lxf + + [-L 1] Z (134) 

where is the codeword sent by the antenna. It follows that the effective noise variances are 

crl,, ~ INR (135) 
^ C (136) 

where C is a constant that does not scale with INR. In this example, the integer-forcing linear receiver is able to 
completely cancel the effect of interference in the second stream by choosing equation coefficients appropriately. 

7^ Generalized Degrees of Freedom 

We evaluate the generalized degrees of freedom for the M x M complex MIMO channel with ET-dimensional 
interference. We specify the interference-to-noise ratio through the parameter a where 

« = 1™ ; TrrTTT (137) 

SNR^oologSNR 
INR-5-00 

and consider the case where < a < 1. The generalized degrees of freedom are defined as follows (see ||34ll ): 

Definition 10: (Generalized Degrees-of-Freedom) For a given channel matrix H and interference matrix J, the 
generalized degrees-of-freedom of a scheme is 

,(H,J)= l,m :?<5!i^ (138) 

^ ' ^ SNR->oo logSNR ^ ^ 

INR=SNR° 

where it!(SNR,H, J) is the achievable sum rate of the scheme. 

Definition 11: (Rational Independence) We call a matrix T*^^^ rationally independent if for all q G , we 
have that 

Tq ^ (139) 

We consider the set of matrices (H, J) such that H~^J is rationally independent. It can be seen that this set has 
Lebesgue measure one. In the next theorem, we show that for this class of matrices, the integer-forcing linear 
receiver achieves the same number of generalized degrees of freedom as the joint decoder, and is thus optimal. 

Theorem 6: Consider the M x M complex MIMO channel with K dimensional interference. The integer-forcing 
linear receiver achieves the generalized degrees of freedom 

d,NT = M -Ka (140) 

for a set of H, J that have Lebesgue measure one. 

A straightforward derivation shows that the optimal joint decoder with 2M streams of data achieves the following 
generalized degrees of freedom 

djoiNT = M-Ka (141) 

(142) 
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for all full-rank channel matrices H, J. The linear MMSE receiver with 2Af data streams and the linear MMSE 
receiver with 2M — 2K data streams achieve the following degrees of freedom for all full-rank channel matrices 
H, J: 

ciMMSE.2M = M-Ma (143) 

«iMMSE,2M-2K = M -K (144) 

When 2M data streams are transmitted (on the real-valued representation of the complex MIMO channel), the 
MMSE receiver does not achieve the optimal number of degrees of freedom since it treats the interference as noise 
at high SNR while the integer-forcing linear receiver mitigates the interference. When only 2M — 2K data streams 
are transmitted, the linear MMSE receiver can first cancel the interference and then separate the data streams to 
achieve a degree of freedom of 2M — 2K. However, this is suboptimal for all regimes a < 1 (see Figure 12). A 
straightforward calculation shows that when the number of transmitted data streams is between 2M — 2K and 2M, 
the performance is strictly suboptimal in terms of degrees of freedom. 




o Joint ML 
— Integer 

■ - - MMSE (2M Data Streams) 
□ MMSE (2M - 2K Data Streams) 



0.2 0.4 0.6 O.i 

Interference Strength a 



Fig. 12. Generalized degrees of freedom for the complex-valued 16 x 16 MIMO channel with 8-dimensional interference (M = 1&, K = . 



Our proof of Theorem [6] uses the following lemma. Recall that a set of vectors is linearly independent if none 
of the vectors can be written as a linear combination of the others. 

Lemma 2: Let T G •^Kx{2M-2K) rationally independent and assume \tij\ < 1 for all There exists a 
Q' G N such that for all Q > Q', there exist 2M linearly independent integer vectors [q^, p^]^ G Z^^'^"^^ x Z^^ 
for m = 1, 2, . . . , 2M, satisfying 

\\cim\\<CQ{logQf (145) 

(146) 



< 



M -K 1 



Q— 



where C is a constant that is independent of Q. 

Lemma [2] bounds the approximation of T with 2M linearly independent integer vectors. Its proof is given in 
Appendix O We now prove Theorem [6] 

Proof: (Theorem |6ll To establish this result, we use the (generally suboptimal) choice of the matrix B that we 
have referred to as exact integer-forcing, i.e., from (11251 ). The achievable rate of this version of the integer-forcing 
linear receiver is given by 

R< max min M log (---—— j-Tf^ rp^ — ^i,^,.^.^ .-t. ..o.-.p, ) • (147) 
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With INR ~ SNR", the effective noise in the worst data stream can be expressed as 

0-^= min max||(H-^)^a„f + ||J^(H-^)^a^f SNR" (148) 

A:|A|7^0 m 

< min maxA2^x(H-i)||a^f + ||Fa„fSNR" , (149) 

A:|A|^0 m 

where Amax(H^^) is the maximum singular value of H^^ and we use the shorthand J = H^-'^J. Let us partition 
J-^ into two parts, 

F = [Si,S2], (150) 

where Si € M2^^(2Af-2iC) ^ ^2Kx2K_ observe that with probabiUty one, has rank 2K (hence, is 

full-rank). This implies that we can permute the columns of in such a way as to ensure that its last 2K columns 
are linearly independent. If we use the same permutation on the coefficients of the vector a^, our upper bound on 
the effective noise variance given in ( 11491 ) will remain unchanged. Therefore, without loss of generality, we may 
assume that S2 has rank 2K. We define T = —82^81. Then, we can write 

82 ^F = [82^81,82^82] (151) 
= [-T,l2x]. (152) 

Let the coefficients for the equation be given by a^ = [q^) Pm]^ where G 1?^'^^'^^ and pm G T?^ . We 
use (11521 ) to bound ||J^am||: 

llFa^f = II8282 'Fa^f (153) 

= ||82[-T,l2x]a^f (154) 

< ALx(S2)||[-T,l2x]a„^f (155) 

= A2,^(82)||Tq„-p,r^f (156) 

where Amax(S2) the maximum singular value of 82. Combining (1156b with ( 11491 ). the effective noise variance is 
bounded as follows: 

a^< min maxALx(H-')l|a„^f + A^^(S2)||Tq^-p„^fSNR° . (157) 

A:|A|7^0 -m 

We proceed to bound the quantity ||Tqm — PmP- We decompose T into its integer and fractional parts: 

T = T/ + Tf (158) 

where Tj represents the integer part of T and Tp represents the fractional part of T. We define 

Pm = Pm - T/q™ (159) 
a^ = [q^,p^f (160) 
A = [ai • • • a2M]^ (161) 

l2M-2if 
— T/ 12^- 



(162) 



Since G is a 2M x 2M lower triangular matrix with non-zero diagonal elements, it has rank 2M. We note that 

Am = G'^km ■ (163) 

Since G is invertible, it follows that if the matrix formed by the coefficient vectors ai, . . . ,a2M is full-rank, then 
the matrix formed by ai, . . . , a2M is fuU-rank and vice versa. From ( 11581 ). it have that 

||Tqm - Pmll = llT^qm - (Pm - T/qm)|| (164) 
= llT^qm - Pmll (165) 
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and, from ( 11631) . we have that 



- iir^ia i|2 

< ALJG-Mlla™l|2 



From (11571 ). (11651 ). and (11671 ). the effective noise variance can be upper bounded by 

< min maxA^x {il~^)>^Lx{G-^)\\^mf + Xi^^{S2)\\T pq^ - p^f SNR" 

A:|A|^0 

From Lemma |2l there exists a Q' such that for all Q > Q', there exist 2M linearly independent vectors: 



a-m — [Qm ' Pm 

satisfying the foUowuig two inequalities: 



X 



for m = 1,...,2M 



|TFq. 



hmW < C{logQfQ 
C(logQ)2 



m fm 



< 



where C is some constant independent of Q. For sufficiently large Q, we observe that 
and (11711 ). we bound the norm of as follows: 



(166) 
(167) 

(168) 

(169) 

(170) 
(171) 

< 1. Using (fTTOl) 





VH 


l|2 + 115 ||2 
Tt II 1 llf m II 


(172) 


< 


II 1 


+ llPmll 


(173) 




ll^m 1 


+ Pm + Ti^qm — Ti?qm 


(174) 


< 


II Qrri 1 


+ TiT'qm + Pm — Ti?qm 


(175) 


< 


l|qm| 


1 IIT n II 1 
+ J-Fqm + M-K 

Q - 


(176) 


< 


llqml 


+ ||Ti.q„|| + l 


(177) 


< 


llqml 


+ AMAx(TF)||qm|| + 1 


(178) 


< 




;g)2g(l + AMAx(TF)) + 1 


(179) 



where Xuax{T^f) is the the maximum singular value of T^. 

Combining (11711 ) and (11791 ). the effective noise variance from (11681 ) is bounded by 

< ALx(H-i)ALx(G-i)(c(logQ)2Q(l + Amax(T)) + l)' + A2,,(S2)SNR^ 

Let Q scale according to (^^ ~ SNR^. It follows that 

< e(logSNR) (SNR^ + SNR"-t( 
Setting 7 = ^a, we find that the generalized degrees-of-freedom are 

d,HT = ,lmi 2M \ 

SNR^oo logSNR 

SNR 



CjlogQ? 

_ M-K 

, Q— 



lim M 

NR-5-00 

M 1-a 



log 



SNR°-IT 



SNR-5-00 log SNR 

M 

M -Ka , 



(180) 
(181) 

(182) 

(183) 

(184) 
(185) 



which concludes the proof of Theorem [6] 
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VIII. Concluding Remarks and Extensions 

In this paper, we proposed a new receiver architecture for MIMO channels that bridges the performance gap 
between traditional linear receivers and the optimal joint ML decoder. We studied the scenario without coding across 
the transmit antennas and found that the proposed integer-forcing linear receiver performs close to the optimal joint 
receiver while incurring only some additional complexity over traditional linear receivers. We characterized the 
diversity multiplexing tradeoff of the proposed architecture and showed that is it the same as that of the joint ML 
decoder. We considered a generalized MIMO channel model with interference and found that the integer-forcing 
receiver can be used to effectively mitigate interference in an oblivious. Furthermore, the proposed architecture 
achieves the same generalized degrees-of-freedom as the joint ML decoder. An interesting question for future work 
is how the integer-forcing architecture behaves when coding across transmit antennas is permitted. 

Appendix A 
Integer-Forcing vs. V-Blast IV 

Recall that V-BLAST II performs decoding in the optimal order and V-BLAST III allows for rate allocation. In 
this appendix, we introduce V-BLAST IV, which allows for both rate allocation and an optimized decoding order. 
Under V-BLAST IV, the data streams are decoded with respect to the ordering 

TT* = arg max min 2Mi?7r(m) (H) . (186) 
vren 

where -R^(m) (H) is given by (11211 ). We compare the behavior of V-BLAST IV to that of the integer-forcing linear 
receiver in Figures [131 [HI [121 The results show although V-BLAST IV achieves good performance for low to 
medium SNR, the integer-forcing linear achieves higher outage rates and lower outage probabilities in the medium 
to high SNR regime. 



„ 7 




5 10 15 20 

SNR (dB) 



Fig. 13. 1 percent outage rates for the 2x2 complex-valued MIMO channel with Rayleigh fading. 

Appendix B 
Proof of Theorem m 

In order to establish Theorem |4l we need a few key facts about lattices. 

Definition 12 (Lattice): A lattice A C M^^^ is a set of points that satisfy the following properties: 



i) € A 

ii) if X, y G A then x + y G A. 



(187) 
(188) 
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Fig. 14. 5 percent outage rates for the 2x2 complex-valued MIMO channel with Rayleigh fading. 




Fig. 15. Outage probability for the 2x2 complex-valued MIMO channel with Rayleigh fading for a target sum rate of i? = 6. 



We call the rank-L matrix G a generator matrix for A if 

A = {Gd : d G Z^*^} (189) 

We use the definition of dual lattices from |[35l . 

Definition 13 (Dual Lattice): Given a lattice A C M^*^ with a rank-L generator matrix G, the dual lattice A* 
has generator matrix (G"'")^, 

A* = I (G^)^ d : d G Z^A^} . (190) 

To prove Theorem |4l we consider successive minima for the involved lattices, a standard concept from the 
Diophantine approximation literature (see e.g. |[36l - ll38l ). defined as follows. 

Definition 14 (Successive Minima): Let ^ = {x G M?^ : ||x|| < l} be the unit ball. Given a lattice A C 9?^ 
with a rank-L generator matrix, the m"^ successive minimum em(A) where 1 < m < L is given by 

£m(-^) = {mine : 3 m linearly independent lattice points vi, . . . , E A n eS} 
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Remark 9: Definition [141 implies that ei(A) < e2(A) < • • • < eL(A) for any lattice A. 

The following basic property linking the successive minima of a lattice with those of its dual lattice is key to 
our proof. 

Lemma 3 ( /55] Proposition 3.3]): Let A C M^^^ be an arbitrary lattice with a rank-L generator matrix and A* 
be its dual lattice. The successive minima for A and A* satisfy the following inequality: 

m?{m + 3) 



for m = 1, 2 



5 ; 



L 



(191) 



Finally, we also need the following result concerning a random Gaussian lattice. 

Lemma 4 ( [39 Lemma 3]): Let H G ]^27Vx2M ^j^g real-valued decomposition of a x M complex Gaussian 
matrix with i.i.d. Rayleigh entries. Let A = |Hd : d G Z^^^} be the lattice generated by H. Then 



^2N 



Pr(ei(A) < s) 



'ys-', M < N, 

5s2^max{-(lns)^+i,l} , M = N. 



where 7 and 6 are constants independent of s. 

Proof: (Theorem 131). Let R = rlogSNR be the target rate where r G [0,Af]. For a fixed set of equations 
A = [ai, • • • , sl2m]'^ and a fixed preprocessing matrix B = [bi, • • • , h2MV^ the outage probability is given by 



pomir, A, B) = Pr \^R(H, A, B) < r log SNR 

= Pr ( mini?(H,a„,b„) < -RlogSNR ) 

= Pr ( max llbmlP + SNRIlH^b™ - a^lP > SNR^" 



For a fixed set of equations A, we are free to choose any projection matrix B, resulting in the following bound: 

Pomir, A) = minpouT(r, A, B) 



< pouT(r, A, AH^) 



Pr I max 



T^t 



> SNR 



1-^ 



We then choose the best set of full-rank equations by optimizing (11921 ) over all integer matrices A G ^2Mx2M 
with non-zero determinant: 



Pomir) = mill pom{r,-^) 

A:|A|>0 



< mill Pr I max 

A:|A|>0 



Pr I min max 

yA:|A|>0 m 



fnn^a. 



> SNR^ 



> SNR 



1-- 



(192) 
(193) 
(194) 



We use properties of dual lattices to bound ( 11941 ). For a fixed H, let Achannel be the lattice generated by H 
and Adual be the dual lattice generated by (H-^)^ 

Achannel = {Hd:dGZ2^} (195) 

Adual = { (H^) ^ d : d G Z^^' } . (196) 

Using the definition of successive minima (Definition [T4l ). it follows that 



mm max 

A;|A|>0 



fH^^a^ 



max em (Adual) 

m=l,---,2M 



e2Af(A; 



dual; 



(197) 
(198) 
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We now express ( |194b in terms of the successive minima of Adual> 

PouT(r)<Prf mill max (H^)"^ a„ ^ > SNR^"t& ) (199) 

= Pr (e^A/(ADUAL) > (200) 
Using Lemma [3l we can bound the successive minima of Aqual in terms of the successive minima of Achannel> 

2Af3 + 3M2 
^channel 

Combining (12001) and (1201b . the outage probability is upper bounded by 

2M3 + 3Af2 



eiAf(ADUAL) < ' "^'^ , ■ (201) 



Pomir) < Pr ( ' "^'^ . > SUR^-Ji (202) 

\ CliAcHANNELj / 
e?(AcHANNEL) < j (203) 

This probability can in turn be upper bounded using Lemma |4l For large SNR, we find that 

max{7,(5}(2M3 + 3M2)^(lnSNR)^+i 
POUT r < K77r~M • 

where 7,5 are constants independent of SNR. The achievable diversity for multiplexing gain r is thus 

dir)= lim (204) 

^ ^ SNR^oo SNR ^ ^ 

> nn^Mim^oJim (205, 

-SNR^oo SNR SNR 

= iV (1 - ^) (206) 



Appendix C 
Proof of Lemma [2] 

In order to prove Lemma [2l we use a technique introduced in POll . We first construct semi-norms / : M?^^ — M 
and g : M^^ ^ R as well as a norm h : M^*^ — R. We then apply Minkowski's Theorem (see Theorem |7] below) 
to find 2M linearly independent integer vectors that achieve the successive minima (with respect to the norm h). 
Afterwards, we will show that these integer vectors satisfy the conditions in Lemma |2] We will need the following 
definitions and theorems in the proof. 

Definition 15 (h-Unit Ball): Let h : R^^^ — R be a norm. The unit ball with respect to h (or /i-unit ball) is 
denoted by 

Bh = {xe R^*'^ : h{x) < 1} . (207) 

The volume of is denoted by V/j. 

Definition 16 (Successive h-Minima): Let h : R^^^ — R be a norm and Bh be the /i-unit ball. The successive 
/i-minimum em is given by 

em = { mill e : 3 in linearly independent integer points vi , . . . , Vm G Z^*^ n eBh } 

for 1 < m < 2M. 

Theorem 7 (Minkowski): For any norm h : R^^^ — )■ R, the successive /i-minima satisfy 

2M 

Vh'[le^<2^^^. (208) 

1=1 
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Theorem 8 (Dirichlet): For any T G ^^k>,{2M-2K) ^ ^ ^^^^^ g^^j^t^ ^ [qT^ pTjT ^ ^2M-2X ^ ^2X\^o 
such that 

llqlloo < Q (209) 

l|Tq-p||oo < (210) 
Q— 

Theorem 9 (Khinchin-Groshev): Fix a function : N — R"*". If 

oo 

^,2M-2;.-l(^^^))2K^^^ (211) 
9=1 

then, for almost all T G ]^2Js'x(2M-2K) satisfying \ti j\ < 1 for all there are only finitely many solutions 
[q^, p^]^ G Z2A^-2^ X to the inequality 

l|Tq-p||oo < ^-dlqlloo) . (212) 

Theorem |7] can be found in ||38l Equation 1.1]. Theorems [8] and |9] can be found in many standard Diophantine 
approximation texts (see, for instance, [36 1). 

Proof: (Lemma |2]l. For any vector v G 7?^'^ , we denote the first 2M — 2K components by q and the remaining 
2i^components by p, and will thus write 



(213) 



From the lemma statement, T = [ti • • • t2KY ^ (rationally independent) 2K x (2M — 2K) real-valued matrix 
with |fj jl < 1 for all For a fixed T, we define the semi-norms /, g as follows: 

/(v) = ||Tq-p|| (214) 

5(v) = ||q|| . (215) 
For a fixed Q > 2M, we let Ai denote the minimum value of /(v) under the constraint ||q|| < Q, 

Ai = mill /(v) (216) 

9(v)<Q 

= min min llTq— pll, (217) 

l|q||<Q [q^.P^r^^O 

qi G denote the integer vector that achieves Ai 

qi = argmin min ||Tq — p||, (218) 

||q||<Q [q^,P^]^7^0 

and fii be the value of qi evaluated by g, 

^1 = 5(qi) (219) 
= llqill ■ (220) 
Based on the seminorms / and g, we define the function h : M?^^ — )• M as follows: 

hM = (/'(v) + ^^'(v)) ^ (221) 

Tq-pf + ^||qf') ^ . (222) 

In the sequel, we show that /i is a norm for Q > 1. We define the 2M x 2M matrix T as follows: 

T -hK 
*-2M-2K 



Ml 
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Note that we can rewrite the function h using F: 



irvii. 



Since exchanging rows of a matrix only affects the sign of its determinant, we have that 



|det(r)| 



det 



— i2M-2K 



Ml 



i-2K 



(223) 



(224) 



Now we use the fact that the determinant of a lower triangular matrix is just the product of its diagonal entries, 

2M-2K 



|det(r)| 



(225) 



Consider the case where Q > 1. Since T is rationally independent, it follows that Ai > 0. Since /xi > by 
definition, we have that ^ > 0. Since T is full-rank and thus injective, /i is a norm. 
Let Vh be the volume of the /i-unit ball. Let u = Tv. It follows that: 



{v:l|rvl|<l} 



{u:||u||<l} 

1 



det(r-^)|du 



du 



|det(r)| 7{u:||u||<l} 



1 



det(r)| 

, 2M-2K 

Al 



V2M, 



(226) 
(227) 
(228) 
(229) 
(230) 



where V2M is the volume of the unit ball with respect to the Euclidean norm (in 2M dimensional space). 

Let ei, . . . , e2M be the successive minima with respect to h (see Definition [T6l ). Let yi, . . . , y2M £ be the 
linearly independent integer points that achieve the successive minima, i.e: 



Hyi) = ^i- 

Using Minkowski's Theorem on successive minima (Theorem IT), we have that: 



2M 



4 = 1 



where Vh is the volume of the /i-unit ball. Using (12301 ). we have that: 

2M-2K 2M 

V2M \{ei<2 



Al 



2M 



i=l 



Rewriting the above, we have that 



2M-2K 2M 
i=l 



(231) 



(232) 



(233) 



(234) 
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where C is a constant that depends only on 2M. Rearranging ( I234K we arrive at the following: 

e2M<c(^... { ^ ) i^l^] (235) 



Ai Ai 



^ ',61 e2M-2i. J 162M-L+1 62M-J l^/if^'^^A^^-^ J ^^^^^ 

^ ( ~ ~ I I ~^}Mz:Tir7oW^ I (237) 



ei e2M-iy \ /if^-2^Af-i 



Ai Ai \ / A^ 

ei ■■■e2A/-J Uf^-2^A2^ 



C7(-i..._^) I ,^,Vx,o. I- (238) 



For all V e Z2^^\{0}, we have that /i(v) > Ai. To see this, we can consider the case where ||q|| < and 
|q|| > ^1 separately. When ||q|| > /^i, h can be bounded as follows 



^ Xl/2 



Mv)= (||Tq-pf + ^||q||2) (239) 



Ai„ „ 

> — q (240) 

> Ai. (241) 



We now consider the case where ||q|| < jii. We first bound h as follows 

/i(v) = (^||Tq-p||2 + -|||q||2j (242) 

>||Tq-p||. (243) 

Recall that /xi = ||qi|| and Ai = minpgx2K ||Tqi — p||. Assume that there exists a q with ||q|| < ||qi|| such that 

mill ||Tq - p|| < min ||Tqi - p|| (244) 
p p 

= Ai, (245) 

then the definition of qi in (12181 ) is violated. Hence, in this case, /i(v) > ||Tq — p|| > Ai. 
Since Ej = h{yj) for some yj G M^*^, it follows that 

ej > Ai for all j = 1, ... , 2M. (246) 

Combing the above with (12381 ). it follows that 

< C . (247) 

^1 -^1 

From the definition of h, e2M, and yi • • • y2M> we have that 

h{yj)<e2M for j = 1 • • • 2M. (248) 
By the construction of h (see (12211 )). we have that: 

/(y,) < h{yj) < 62M < C-—^^^ (249) 

^5(y,) < My.) < ^2M < c ^2K^M-2K ' (250) 
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for j = 1, • • • , 2M. The above equations imply that 

/(y.) < g ,2X \m-2K (251) 

5(yi) < '^ .2i^,^M-2j^ ' (252) 

for i = 1,...,2M. 

Recall that Ai,//i are defined with respect to a fixed Q. We now show that for sufficiently large Q, 

A.(Q)-«(Q)--- > <253) 
We define the function : Z — )• M as follows 

■^{q) = l for g = 1 (254) 
^{q) = \ for q>l. (255) 

q 2Jf (log 2fC 

We note that with this choice of ^f, it follows that 

Y^q^M-2K-l^^^-^2K (256) 

Applying Theorem |9j we have that for rationally independent T G '^Kx{2M-2K) ^ there are only a finite number 
of integer solutions [q"^,p^]^ G 7?^'^^^^ x that satisfy the following condition: 

l|Tq- pIIoo < — 2M-2K „ • 

(257) 

||q||oo''^ (log llqlloo)^^^ 

We rewrite this condition as follows 

l|q|lL^'-^^l|Tq-p||L^ < TTinr^- (258) 

(log||q||oo)^ 

We first fix an rationally independent T G ]]j2Xx(2M-2i<') Re^-^u fj-om (12181 ) that qi((3) is the integer vector that 
achieves Xi{Q) for a given Q. Clearly, {||qi(Q)||oo}Q is a non-decreasing integer sequence (in Q). Assume that 
l|qi(Q)l|oo is unbounded as Q — )■ oo. By Theorem |9l we know that there are only a finite number of integers q 
that satisfy the condition in (1258b . Let q' be the integer with the largest Loo norm that satisfies the condition in 
(I258I ). This suggests that for all Q where ||qi((5)||oo > ||q'||oo, qi(<3) does not satisfying the condition in (12581 ). 
Since {||qi(<5)||oo}Q is an unbounded non decreasing sequence, there exists some Q' such that for all Q > Q', 
qi [Q] does not satisfy the condition in ( 12581 ). Note that ( 12531) follows since any q, p that satisfies 

1 

^2 



q|lL^-^^l|Tq-p||L^ > . (259) 



(log ||q|loo; 



also satisfies 



||^||2Af-2X||rp^ ^\\2K ^ 1 

(iog||q||) 

Finally, for any rationally independent T G M2_ft'x(2M-2iC)^ prove that sequence {||qi((5)||oo}Q is unbounded as 
Q — )• oo. We prove this by contradiction. That is, assume that there exists some C G Z+ such that ||qi((5)l|oo < C 
for all Q. This implies that q_i{Q) takes only a finite set of values. Hence, there exists a C such that 

min ||Tqi(Q)-p||oo >C' (261) 



for all Q. However, by definition of Xi{Q) and Dirichlet's theorem (Theorem [8]) we have that 

min ||Tqi(Q) - p||oo < 2m-2k (262) 
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for all Q G N. This results in a contradiction with our assumption. Dirichlet's Theorem (Theorem 8) is defined in 
terms of the £^0 norm and Ai is defined in terms of the £2 norm. Using the fact that 

Ai = min min llTq — pll (263) 

llq||<Q [q^,P^r^o 

<V2K min min ||Tq-p||oo, (264) 

l|qil<Q [q^,P^]^7^o 



and (I262I ). we have that 



By definition, we have that 



V2K 

Ai < 2M-2K ■ (265) 



^il < Q. (266) 

Using (12511 ). (12531 ). (12651 ). and (12661 ). and assuming that Q is sufficiently large, we bound /(yj) as follows: 

^1 
A^^^2 

<(7Ai(log//i)2 (268) 

< 2M-2K (269) 

< C'^^, (270) 

where C is a constant that does not depend on Q. Similarly, we can bound giyj) as follows: 

giyj) < C/ii(log^i)2 < C'QilogQf, (271) 
which concludes the proof of Lemma |2l ■ 
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